Implementing O(N) N-Body Algorithms Efficiently in Data-Parallel Languages

نویسندگان

  • Yu Hu
  • S. Lennart Johnsson
چکیده

The optimization techniques for hierarchical O(N) N{body algorithms described here focus on managing the data distribution and the data references, both between the memories of diierent nodes, and within the memory hierarchy of each node. We show how the techniques can be expressed in data{parallel languages, such as High Performance Fortran (HPF) and Connection Machine Fortran (CMF). The eeectiveness of our techniques is demonstrated on an implementation of Anderson's hierarchical O(N) N{body method for the Connection Machine system CM{5/5E. Of the total execution time, communication accounts for about 10{20% of the total time, with the average eeciency for arithmetic operations being about 40% and the total eeciency (including communication) being about 35%. For the CM{5E, a performance in excess of 60 MMop/s per node (peak 160 MMop/s per node) has been measured.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Implementing O(n) N{body Algorithms Eeciently in Data Parallel Languages (high Performance Fortran) Implementing O(n) N{body Algorithms Eeciently in Data Parallel Languages (high Performance Fortran)

O(N) algorithms for N{body simulations enable the simulation of particle systems with up to 100 million particles on current Massively Parallel Processors (MPPs). Our optimization techniques mainly focus on minimizing the data movement through careful management of the data distribution and the data references, both between the memories of diierent nodes, and within the memory hierarchy of each...

متن کامل

Implementing O(N) N-body Algorithms Efficiently in Data Parallel Languages (High Performance Fortran)

O(N ) algorithms for N{body simulations enable the simulation of particle systems with up to 100 million particles on current Massively Parallel Processors (MPPs). Our optimization techniques mainly focus on minimizing the data movement through careful management of the data distribution and the data references, both between the memories of di erent nodes, and within the memory hierarchy of eac...

متن کامل

Non-zero probability of nearest neighbor searching

Nearest Neighbor (NN) searching is a challenging problem in data management and has been widely studied in data mining, pattern recognition and computational geometry. The goal of NN searching is efficiently reporting the nearest data to a given object as a query. In most of the studies both the data and query are assumed to be precise, however, due to the real applications of NN searching, suc...

متن کامل

Parallel Generation of t-ary Trees

A parallel algorithm for generating t-ary tree sequences in reverse B-order is presented. The algorithm generates t-ary trees by 0-1 sequences, and each 0-1 sequences is generated in constant average time O(1). The algorithm is executed on a CREW SM SIMD model, and is adaptive and cost-optimal. Prior to the discussion of the parallel algorithm a new sequential generation with O(1) average time ...

متن کامل

Efficient Data Parallel Implementations of Highly Irregular Problems

This dissertation presents optimization techniques for efficient data parallel formulation/implementation of highly irregular problems, and applies the techniques to O(N) hierarchical N–body methods for large–scale N–body simulations. It demonstrates that highly irregular scientific and engineering problems such as nonadaptive and adaptive O(N) hierarchical N–body methods can be efficiently imp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Scientific Programming

دوره 5  شماره 

صفحات  -

تاریخ انتشار 1996